Zisland Explorer: detect genomic islands by combining homogeneity and heterogeneity properties
نویسندگان
چکیده
Genomic islands are genomic fragments of alien origin in bacterial and archaeal genomes, usually involved in symbiosis or pathogenesis. In this work, we described Zisland Explorer, a novel tool to predict genomic islands based on the segmental cumulative GC profile. Zisland Explorer was designed with a novel strategy, as well as a combination of the homogeneity and heterogeneity of genomic sequences. While the sequence homogeneity reflects the composition consistence within each island, the heterogeneity measures the composition bias between an island and the core genome. The performance of Zisland Explorer was evaluated on the data sets of 11 different organisms. Our results suggested that the true-positive rate (TPR) of Zisland Explorer was at least 10.3% higher than that of four other widely used tools. On the other hand, the new tool did not lose overall accuracy with the improvement in the TPR and showed better equilibrium among various evaluation indexes. Also, Zisland Explorer showed better accuracy in the prediction of experimental island data. Overall, the tool provides an alternative solution over other tools, which expands the field of island prediction and offers a supplement to increase the performance of the distinct predicting strategy. We have provided a web service as well as a graphical user interface and open-source code across multiple platforms for Zisland Explorer, which is available at http://cefg.uestc.edu.cn/Zisland_Explorer/ or http://tubic.tju.edu.cn/Zisland_Explorer/.
منابع مشابه
Predicting CpG Islands and Their Relationship with Genomic Feature in Cattle by Hidden Markov Model Algorithm
Cattle supply an important source of nutrition for humans in the world. CpG islands (CGIs) are very important and useful, as they carry functionally relevant epigenetic loci for whole genome studies. As a matter of fact, there have been no formal analyses of CGIs at the DNA sequence level in cattle genomes and therefore this study was carried out to fill the gap. We used hidden markov model alg...
متن کاملMolecular Detection of Genomic Islands Associated With Class 1 and 2 Integron in Haemophilus influenzae Isolated in Iran
BACKGROUND High levels of multidrug resistance are usually associated with mobile genetic elements that encode specific resistance genes. Integrons are important genetic elements involved in spreading antibiotic multi-resistance. In special cases, large exogenous segments in bacterial genomes form genomic islands, and one of the functions of these genomic islands is antibiotic resistance. Due t...
متن کاملA systematic method to identify genomic islands and its applications in analyzing the genomes of Corynebacterium glutamicum and Vibrio vulnificus CMCP6 chromosome I
MOTIVATION Some genomic islands contain horizontally transferred genes, which play critical roles in altering the genotypes and phenotypes of organisms, and horizontal gene transfer has been recognized as a universal event throughout bacterial evolution. A windowless method to display the distribution of genomic GC content, the cumulative GC profile, is proposed to identify genomic islands in g...
متن کاملGenomic homogeneity in fibrolamellar carcinomas.
BACKGROUND Fibrolamellar carcinoma (FLC) is a variant of hepatocellular carcinoma (HCC) with distinctive clinical and histological features. To date there have been few studies on the genotypic aspects of FLC and no previous attempts have been made to use the arbitrarily primed-polymerase chain reaction (AP-PCR) technique to detect genetic alterations in this disease. AIM The aim of this stud...
متن کاملIntegrative analysis of multiple cancer genomic datasets under the heterogeneity model.
In the analysis of cancer studies with high-dimensional genomic measurements, integrative analysis provides an effective way of pooling information across multiple heterogeneous datasets. The genomic basis of multiple independent datasets, which can be characterized by the sets of genomic markers, can be described using the homogeneity model or heterogeneity model. Under the homogeneity model, ...
متن کامل